Overview of the NTCIR-12 Short Text Conversation Task

نویسندگان

  • Lifeng Shang
  • Tetsuya Sakai
  • Zhengdong Lu
  • Hang Li
  • Ryuichiro Higashinaka
  • Yusuke Miyao
چکیده

We describe an overview of the NTCIR-12 Short Text Conversation (STC) task, which is a new pilot task of NTCIR-12. STC consists of two subtasks: a Chinese subtask using post-comment pairs crawled from Weibo, and a Japanese subtask providing the IDs of such pairs from Twitter. Thus, the main difference between the two subtasks lies in the sources and languages of the test collections. For the Chinese subtask, there were a total of 38 registrations, and 16 of them finally submitted 44 runs. For the Japanese subtask, there were 12 registrations in total, and 7 of them submitted 25 runs. We review in this paper the task definition, evaluation measures, test collections, and the evaluation results of all teams.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BUPTTeam Participation in NTCIR-12 Short Text Conversation Task

Abstract This paper provides an overview of BUPTTeam’s system participated in the Short Text Conversation (STC) task of Chinese at NTCIR-12. STC is a new NTCIR challenging task which is defined as an IR problem, i.e., retrieval based a repository of postcomment pairs from Sina Weibo. In this paper, we propose a novel method to retrieve post result from the repository based on the following four...

متن کامل

Nders at the NTCIR-12 STC Task: Ranking Response Messages with Mixed Similarity for Short Text Conversation

Short Text Conversation (STC) is a typical scenario in manmachine conversation, which simplifies the conversation into one round interaction and makes the related tasks more practical. This paper presents a simple approach to the Chinese STC task issued by NTCIR-12. Given a repository of post-comment pairs, for any query, we define three types of similarity and merged them according to empirica...

متن کامل

Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Text Conversation Task

According to rise of social networking services, short text like micro-blogs has become a valuable resource for practical applications. When using text data in applications, similarity estimation between text is an important process. Conventional methods have assumed that an input text is sufficiently long such that we can rely on statistical approaches, e.g., counting word occurrences. However...

متن کامل

OKSAT at NTCIR-12 Short Text Conversation Task: Priority to Short Comments, Filtering by Characteristic Words and Topic Classification

Our group OKSAT submitted five runs for Chinese and Japanese subtasks of the NTCIR-12 Short Text Conversation task (STC). We searched not only posts but also comments for terms of each query (post). We also gave more priority to short comments than longer ones. Then we filtered retrieved comments by characteristic words including proper nouns. We added attributes to the corpus and also to the q...

متن کامل

SLSTC at the NTCIR-12 STC Task

The SLSTC team participated in the NTCIR-12 Short Text Conversation (STC)[1] task. This report describes our approach to solving the STC problem and discusses the ocial results.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016